Video stream playing control method and apparatus, and storage medium

ABSTRACT

Embodiments of the present application provide a video stream playing control method and apparatus, a device, and a storage medium. The method comprises: in a situation that a human-computer interaction entrance invoking triggering condition is satisfied, stopping playing a first video stream, and invoking a human-computer interaction entrance for the first video stream while displaying a playing picture of the first video stream; after target feedback information is obtained by means of the human-computer interaction entrance or an instruction for closing the human-computer interaction entrance is received, closing the human-computer interaction entrance, and playing a second video stream different from the first video stream, the target feedback information comprising feedback information for the first video stream.

CROSS REFERENCE OF RELATED APPLICATION

The present application is a continuation application of InternationalPatent Application No. PCT/CN2021/077971, titled “VIDEO STREAM PLAYINGCONTROL METHOD AND APPARATUS, AND STORAGE MEDIUM”, filed on Feb. 25,2021, which claims priority to Chinese Patent Application No.202010287736.7, titled “VIDEO STREAM PLAYING CONTROL METHOD ANDAPPARATUS, AND STORAGE MEDIUM”, filed on Apr. 13, 2020 with the ChinaNational Intellectual Property Administration, both of which areincorporated herein by reference in their entireties.

FIELD

Embodiments of the present disclosure relate to the technical field ofvideo stream control, and in particular to a method for controllingplaying of a video stream, a device, and a storage medium.

BACKGROUND

With the development of Internet technology, video stream applicationprograms running on terminal devices are widely used. In theconventional technology, feedback information such as satisfaction of auser on the video stream is generally collected based on whether theuser completely watches the video stream. However, this kind of feedbackinformation is low in accuracy, and is not fully utilized in theconventional technology.

SUMMARY

A method for controlling playing of a video stream, a device, and astorage medium are provided according to embodiments of the presentdisclosure, so as to pertinently and accurately acquire user feedback onthe video stream that is currently played, and avoid affecting the videoplaying effect due to a human-computer interaction interface blockingthe playing video stream.

In a first aspect, a method for controlling playing of a video stream isprovided according to an embodiment of the present disclosure. Themethod includes: stopping playing a first video stream when a triggercondition for invoking a human-computer interaction portal is satisfied,and invoking the human-computer interaction portal for the first videostream while displaying a playing window of the first video stream; andclosing the human-computer interaction portal and playing a second videostream different from the first video stream on receipt of targetfeedback information through the human-computer interaction portal or onreceipt of an instruction to close the human-computer interactionportal. The target feedback information includes feedback informationabout the first video stream.

In the method according to the embodiment of the present disclosure, thehuman-computer interaction portal for the first video stream is invokedwhile displaying the playing window of the first video stream, toacquire feedback information, so as to accurately and pertinentlyacquire the user feedback on the first video stream that is currentlyplayed. Moreover, during the process of invoking the human-computerinteraction portal, the playing of the video stream is paused (i.e., thefirst video stream is stopped playing), and the video stream is resumedplaying (i.e., the second video stream is played) after thehuman-computer interaction portal is closed. Even if the human-computerinteraction portal blocks the playing window of the first video stream,the playing effect is not affected by blocking the video stream that iscurrently played because the playing of the first video stream isstopped during this process. In addition, the playing window of thestopped first video stream is still displayed since the first videostream is stopped playing, thereby prompting the user that the invokedhuman-computer interaction portal is for the first video stream, and theprompt is intuitive and simple.

Optionally, the second video stream different from the first videostream may be played in a manner that includes but is not limited to,automatically playing the second video stream in response to acquisitionof the target feedback information or in response to the instruction toclose the human-computer interaction portal.

In the method according to the embodiment of the present disclosure, thevideo stream is resumed playing in response to the acquisition of thetarget feedback information or the instruction to close thehuman-computer interaction portal. That is, the applicationautomatically plays the second video stream after the user inputs thetarget feedback information or closes the human-computer interactionportal, thereby achieving a seamless transition between human-computerinteraction and automatic continuation of playing as well as improvingthe smoothness of video playing.

On the basis of any one of the described method embodiments, the secondvideo stream played and/or at least one video stream to be played isdetermined based on the target feedback information.

In the method according to the embodiment of the present disclosure, thevideo stream recommendation is performed in real time based on thetarget feedback information. The faster the operation speed of therecommendation model based on the target feedback information, thefaster the video stream recommendation is performed based on the targetfeedback information in order to resume playing the video stream,thereby improving the real-time performance of the video recommendationand the user experience. If the human-computer interaction portal isused to acquire the user interest in the first video stream, the targetfeedback information may truly reflect the user interest in the firstvideo stream. Therefore, real-time video stream recommendation based onthe target feedback information may also improve the accuracy of videorecommendation.

In addition, in the method according to the embodiment of the presentdisclosure, it is necessary to acquire richer feedback content in orderto make video stream recommendation in real time based on the targetfeedback information. For example, multiple feedback questions are setor multiple options are set for a feedback question, which requires arelatively larger display area. As described above, in the methodaccording to the embodiment of the present disclosure, the playing ofthe video stream is stopped when the human-computer interaction portalis invoked. Therefore, in the case where the human-computer interactionportal is a human-computer interaction interface, the playing effect ofthe video stream is not affected by blocking the video stream beingplayed even if a relatively large human-computer interaction interfaceis displayed.

On the basis of any one of the method embodiments, the trigger conditionfor invoking the human-computer interaction portal may include at leastone of: a video stream switching instruction is received; the number oftimes that the first video stream is played repeatedly reaches athreshold; playing of the first video stream is finished already; aspecific target object is detected during the playing of the first videostream; and the first video stream is played to a certain time instant.

In practice, the method according to the embodiment of the presentdisclosure supports flexible configuration of the trigger condition forinvoking the human-computer interaction portal, so as to meet productrequirements.

In one embodiment, the terminal device may configure the received videostream switching instruction as the trigger condition for invoking thehuman-computer interaction portal. When receiving the video streamswitching instruction, the terminal device invokes the human-computerinteraction portal in response to the instruction. This implementationmanner of invoking the human-computer interaction portal does notinterrupt the first video stream being played, thereby reducing theinterference to the playing of the video stream.

In one embodiment, the terminal device may configure the completion ofplaying the first video stream as the trigger condition for invoking thehuman-computer interaction portal. When the playing of the first videostream is finished, the terminal device invokes the human-computerinteraction portal. This implementation of invoking the human-computerinteraction portal does not interrupt the first video stream beingplayed, thereby reducing the interference to the playing of the videostream.

Taking a short video playing application as an example, in anon-automatic playing mode, the playing control process of a videostream is commonly as follows. If no video stream switching instructionis detected, the current video stream is played repeatedly. When a videostream switching instruction is detected, the current video stream isswitched to a next video stream. For the scenario where the currentvideo stream is played repeatedly with no video stream switchinginstruction being received, the repetition of the current video streammay be resulted from that the user attention is diverted and the userforgets to switch or close the video stream rather than means that theuser is more interested in the content of the video stream. In thiscase, in one embodiment, the terminal device may configure the number ofrepetitions of the first video stream reaching the threshold as thetrigger condition for invoking the human-computer interaction portal. Ifthe number of times that the first video stream is played repeatedlyreaches the threshold, the terminal device requests the user feedbackinformation about the video stream by invoking the human-computerinteraction portal. This can not only improve the accuracy of thefeedback information, but also improve the accuracy of subsequent use ofthe feedback information, for example, improve the accuracy of videostream recommendation based on the feedback information provided by theuser.

In a second aspect, a terminal device is provided according to anembodiment of the present disclosure. The terminal device may include afirst playing control module and a second playing control module.

The first playing control module is configured to stop playing a firstvideo stream when a trigger condition for invoking a human-computerinteraction portal is satisfied, and invoke the human-computerinteraction portal for the first video stream while displaying theplaying window of the first video stream.

The second playing control module is configured to close thehuman-computer interaction portal and play a second video streamdifferent from the first video stream on receipt of a target feedbackinformation through the human-computer interaction portal or aninstruction to close the human-computer interaction portal. The targetfeedback information includes feedback information about the first videostream.

With the terminal device according to the embodiment of the presentdisclosure, the human-computer interaction portal for the first videostream is invoked while the playing window of the first video stream isdisplayed, to acquire feedback information, so as to accurately andpertinently acquire the user feedback on the first video stream that iscurrently played. Moreover, during the process of invoking thehuman-computer interaction portal, the playing of the video stream ispaused (i.e., the first video stream is stopped playing), and the videostream is resumed playing (i.e., the second video stream is played)after the human-computer interaction portal is closed. Even if thehuman-computer interaction portal blocks the playing window of the firstvideo stream, the playing effect is not affected by blocking the videostream that is currently played because the playing of the first videostream is stopped during this process. In addition, the playing windowof the stopped first video stream is still displayed since the firstvideo stream is stopped playing, thereby prompting the user that theinvoked human-computer interaction portal is for the first video stream,and the prompt is intuitive and simple.

Optionally, the second playing control module plays the second videostream different from the first video stream in a manner that includes,but is not limited to, automatically playing the second video stream inresponse to acquisition of the target feedback information or inresponse to the instruction to close the human-computer interactionportal.

The terminal device according to the embodiment of the presentdisclosure resumes playing the video stream in response to theacquisition of the target feedback information or the instruction toclose the human-computer interaction portal. That is, the applicationautomatically plays the second video stream after the user inputs thetarget feedback information or closes the human-computer interactionportal without manual trigger by the user, thereby achieving a seamlesstransition between human-computer interaction and automatic continuationof playing as well as improving the smoothness of video playing.

On the basis of any one of the described terminal device embodiments,the second video stream played and/or at least one video stream playedis determined based on the target feedback information.

The terminal device according to the embodiment of the presentdisclosure performs video stream recommendation in real time based onthe target feedback information. The faster the operation speed of therecommendation model based on the target feedback information, thefaster the video stream recommendation is performed based on the targetfeedback information in order to resume playing the video stream,thereby improving the real-time performance of the video recommendationand the user experience. If the human-computer interaction portal isused to acquire the user interest in the first video stream, the targetfeedback information may truly reflect the user interest in the firstvideo stream. Therefore, real-time video stream recommendation based onthe target feedback information may also improve the accuracy of videorecommendation.

In addition, with the terminal device according to the embodiment of thepresent disclosure, it is necessary to acquire richer feedback contentin order to make video stream recommendation in real time based on thetarget feedback information. For example, multiple feedback questionsare set or multiple options are set for a feedback question, whichrequires a relatively larger display area. As described above, with theterminal device according to the embodiment of the present disclosure,the playing of the video stream is stopped when the human-computerinteraction portal is invoked. Therefore, in the case where thehuman-computer interaction portal is a human-computer interactioninterface, the playing effect of the video stream is not affected byblocking the video stream being played even if a relatively largehuman-computer interaction interface is displayed.

On the basis of any one of the terminal device embodiments, the triggercondition for invoking the human-computer interaction portal may includeat least one of: a video stream switching instruction is received; thenumber of times that the first video stream is played repeatedly reachesa threshold; playing of the first video stream is finished already; aspecific target object is detected during the playing of the first videostream; and the first video stream is played to a certain time instant.

In practice, the terminal device according to the embodiment of thepresent disclosure supports flexible configuration of the triggercondition for invoking the human-computer interaction portal, so as tomeet product requirements.

In one embodiment, the terminal device may configure the received videostream switching instruction as the trigger condition for invoking thehuman-computer interaction portal. When receiving the video streamswitching instruction, the terminal device invokes the human-computerinteraction portal in response to the instruction. This implementationmanner of invoking the human-computer interaction portal does notinterrupt the first video stream being played, thereby reducing theinterference to the playing of the video stream.

In one embodiment, the terminal device may configure the completion ofplaying the first video stream as the trigger condition for invoking thehuman-computer interaction portal. When the playing of the first videostream is finished, the terminal device invokes the human-computerinteraction portal. This implementation of invoking the human-computerinteraction portal does not interrupt the first video stream beingplayed, thereby reducing the interference to the playing of the videostream.

Taking a short video playing application as an example, in thenon-automatic playing mode, the playing control process of a videostream is commonly as follows. If no video stream switching instructionis detected, the current video stream is played repeatedly. When a videostream switching instruction is detected, the current video stream isswitched to a next video stream. For the scenario where the currentvideo stream is played repeatedly with no video stream switchinginstruction being received, the repetition of the current video streammay be resulted from that the user attention is diverted and the userforgets to switch or close the video stream rather than means that theuser is more interested in the content of the video stream. In thiscase, in one embodiment, the terminal device may configure the number ofrepetitions of the first video stream reaching the threshold as thetrigger condition for invoking the human-computer interaction portal. Ifthe number of times that the first video stream is played repeatedlyreaches the threshold, the terminal device requests the user feedbackinformation about the video stream by invoking the human-computerinteraction portal. This can not only improve the accuracy of thefeedback information, but also improve the accuracy of subsequent use ofthe feedback information, for example, improve the accuracy of videostream recommendation based on the feedback information provided by theuser.

In a third aspect, an electronic device is provided according to anembodiment of the present disclosure. The electronic device includes aprocessor and a memory. The memory is configured to store a program forimplementing the method described in the various method embodiments ofthe first aspect. The processor is configured to execute the programstored in the memory.

With the electronic device according to the embodiment of the presentdisclosure, the human-computer interaction portal for the first videostream is invoked while the playing window of the first video stream isdisplayed, to acquire feedback information, so as to accurately andpertinently acquire the user feedback on the first video stream that iscurrently played. Moreover, during the process of invoking thehuman-computer interaction portal, the playing of the video stream ispaused (i.e., the first video stream is stopped playing), and the videostream is resumed playing (i.e., the second video stream is played)after the human-computer interaction portal is closed. Even if thehuman-computer interaction portal blocks the playing window of the firstvideo stream, the playing effect is not affected by blocking the videostream that is currently played because the playing of the first videostream is stopped during this process. In addition, the playing windowof the stopped first video stream is still displayed since the firstvideo stream is stopped playing, thereby prompting the user that theinvoked human-computer interaction portal is for the first video stream,and the prompt is intuitive and simple.

In a fourth aspect, a computer-readable storage medium is providedaccording to an embodiment of the present disclosure. Thecomputer-readable storage medium stores a program for implementing themethod described in the various embodiments of the first aspect.

With the program stored in the computer-readable storage mediumaccording to the embodiment of the present disclosure, thehuman-computer interaction portal for the first video stream is invokedwhile the playing window of the first video stream is displayed, toacquire feedback information, so as to accurately and pertinentlyacquire the user feedback on the first video stream that is currentlyplayed. Moreover, during the process of invoking the human-computerinteraction portal, the playing of the video stream is paused (i.e., thefirst video stream is stopped playing), and the video stream is resumedplaying (i.e., the second video stream is played) after thehuman-computer interaction portal is closed. Even if the human-computerinteraction portal blocks the playing window of the first video stream,the playing effect is not affected by blocking the video stream that iscurrently played because the playing of the first video stream isstopped during this process. In addition, the playing window of thestopped first video stream is still displayed since the first videostream is stopped playing, thereby prompting the user that the invokedhuman-computer interaction portal is for the first video stream, and theprompt is intuitive and simple.

With the method for controlling playing of a video stream, the deviceand the storage medium according to the embodiments of the presentdisclosure, video stream recommendation is performed in real time basedon the target feedback information.

In a fifth aspect, a video stream playback control method is providedaccording to an embodiment of the present disclosure. The methodincludes: invoking a human-computer interaction portal for a first videostream that is currently played when the trigger condition for invokingthe human-computer interaction portal is satisfied, so as to acquiretarget feedback information through the human-computer interactionportal, where the target feedback information includes feedbackinformation about the first video stream; and playing a second videostream different from the first video stream after the target feedbackinformation is acquired. The second video stream and/or at least onevideo stream to be played is determined based on the target feedbackinformation.

With the method according to the embodiment of the present disclosure,the target feedback information can be acquired accurately andpertinently, and the video stream recommendation can be performed inreal time based on the target feedback information. The faster theoperation speed of the recommendation model based on the target feedbackinformation, the faster the video stream recommendation is performedbased on the target feedback information in order to resume playing thevideo stream, thereby improving the real-time performance of the videorecommendation and the user experience. If the human-computerinteraction portal is used to acquire the user interest in the firstvideo stream, the target feedback information may truly reflect the userinterest in the first video stream. Therefore, real-time video streamrecommendation based on the target feedback information may also improvethe accuracy of video recommendation.

Optionally, the playing, after the target feedback information isacquired, the second video stream different from the first video streamin a manner that includes but is not limited to, automatically playingthe second video stream in response to acquisition of the targetfeedback information.

In method according to the embodiment of the present disclosure, thesecond video stream is automatically played in response to acquisitionof the target feedback information, which can realize seamlesstransition between the human-computer interaction and the operation ofautomatically playing the video stream, and improve the smoothness ofvideo playing. Moreover, in a case that the second video stream isdetermined based on the target feedback information, the user feedbackcan be quickly responded according to the embodiments of the presentdisclosure, thereby further improving the user experience.

Optionally, in the method according to the embodiment of the presentdisclosure, the first video stream is stopped playing when the triggercondition for invoking the human-computer interaction portal issatisfied. Correspondingly, the implementation of invoking thehuman-computer interaction portal may include invoking thehuman-computer interaction portal while displaying the playing window ofthe first video stream. In addition, after receiving the instruction toclose the human-computer interaction portal, the human-computerinteraction portal is closed, and the second video stream is played.

In the method according to the embodiment of the present disclosure, thehuman-computer interaction portal for the first video stream is invokedto acquire feedback information while displaying the playing window ofthe first video stream, so as to accurately and pertinently acquire theuser feedback on the first video stream that is currently played. Inaddition, during the process of invoking the human-computer interactionportal, the playing of the video stream is paused (i.e., the first videostream is stopped playing), and the video stream is resumed playing(i.e., the second video stream is played) after the human-computerinteraction portal is closed. Even if the human-computer interactionportal blocks the playing window of the first video stream, the playingeffect is not affected by blocking the video stream that is currentlyplayed because the playing of the first video stream is stopped duringthis process. In addition, the playing window of the stopped first videostream is still displayed since the first video stream is stoppedplaying, thereby prompting the user that the invoked human-computerinteraction portal is for the first video stream, and the prompt isintuitive and simple.

In a sixth aspect, a terminal device is provided in real time accordingto the present disclosure. The terminal device includes a human-computerinteraction portal invocation module and a video stream playing module.

The human-computer interaction portal invocation module is configured toinvoke a human-computer interaction portal for a first video stream thatis currently played when the trigger condition for invoking thehuman-computer interaction portal is satisfied, so as to acquire targetfeedback information through a human-computer interaction interface. Thetarget feedback information includes feedback information about thefirst video stream. The video stream playing module is configured toplay a second video stream different from the first video stream afterthe target feedback information is acquired. The second video streamand/or at least one video stream to be played is determined based on thetarget feedback information.

The terminal device according to the embodiment of the presentdisclosure can acquire the target feedback information accurately andpertinently, and perform the video stream recommendation in real timebased on the target feedback information. The faster the operation speedof the recommendation model based on the target feedback information,the faster the video stream recommendation is performed based on thetarget feedback information in order to resume playing the video stream,thereby improving the real-time performance of the video recommendationand the user experience. If the human-computer interaction portal isused to acquire the user interest in the first video stream, the targetfeedback information may truly reflect the user interest in the firstvideo stream. Therefore, real-time video stream recommendation based onthe target feedback information may also improve the accuracy of videorecommendation.

Optionally, the video stream playing module plays the second videostream different from the first video stream after the target feedbackinformation is acquired in a manner including but not limited to:automatically playing the second video stream in response to acquisitionof the target feedback information.

The terminal device according to the embodiment of the presentdisclosure automatically plays the second video stream in response toacquisition of the target feedback information, which can realizeseamless transition between the human-computer interaction and theoperation of automatically playing the video stream, and improve thesmoothness of video playing. Moreover, in a case that the second videostream is determined based on the target feedback information, the userfeedback can be quickly responded according to the embodiments of thepresent disclosure, thereby further improving the user experience.

Optionally, the video stream playing module further stops playing thefirst video stream when the trigger condition for invoking thehuman-computer interaction portal is satisfied. Correspondingly, theimplementation manner of invoking the human-computer interaction portalby the human-computer interaction portal invocation module may includeinvoking the human-computer interaction portal while the playing windowof the first video stream is displayed. In addition, after receiving theinstruction to close the human-computer interaction portal, thehuman-computer interaction portal vocation module closes thehuman-computer interaction portal, and the video stream playing moduleplays the second video stream. The target feedback information includesfeedback information about the first video stream.

The terminal device according to the embodiment of the presentdisclosure stops playing the video stream (i.e., stops playing the firstvideo stream) during the process of invoking the human-computerinteraction portal, and automatically resumes playing the video stream(i.e., plays the second video stream) after closing the human-computerinteraction portal. Even if the human-computer interaction portal blocksthe playing window of the first video stream, the playing effect is notaffected by blocking the video stream that is currently played becausethe playing of the first video stream is stopped during this process. Inaddition, the playing window of the stopped first video stream is stilldisplayed since the first video stream is stopped playing, therebyprompting the user that the invoked human-computer interaction portal isfor the first video stream, and the prompt is intuitive and simple.

In a seventh aspect, an electronic device is further provided accordingto an embodiment of the present disclosure. The electronic deviceincludes a processor and a memory. The memory is configured to store aprogram for implementing the method described in the various methodembodiments of the fifth aspect. The processor is configured to executethe program stored in the memory.

The electronic device according to the embodiment of the presentdisclosure can acquire the target feedback information accurately andpertinently, and perform the video stream recommendation in real timebased on the target feedback information. The faster the operation speedof the recommendation model based on the target feedback information,the faster the video stream recommendation is performed based on thetarget feedback information in order to resume playing the video stream,thereby improving the real-time performance of the video recommendationand the user experience. If the human-computer interaction portal isused to acquire the user interest in the first video stream, the targetfeedback information may truly reflect the user interest in the firstvideo stream. Therefore, real-time video stream recommendation based onthe target feedback information may also improve the accuracy of videorecommendation.

In an eighth aspect, a computer-readable storage medium is furtherprovided according to an embodiment of the present disclosure. Thecomputer-readable storage medium stores a program for implementing themethod described in the various embodiments of the fifth aspect.

With the program stored in the computer-readable storage mediumaccording to the embodiment of the present disclosure, the targetfeedback information can be acquired accurately and pertinently, and thevideo stream recommendation can be performed in real time based on thetarget feedback information. The faster the operation speed of therecommendation model based on the target feedback information, thefaster the video stream recommendation is performed based on the targetfeedback information in order to resume playing the video stream,thereby improving the real-time performance of the video recommendationand the user experience. If the human-computer interaction portal isused to acquire the user interest in the first video stream, the targetfeedback information may truly reflect the user interest in the firstvideo stream. Therefore, real-time video stream recommendation based onthe target feedback information may also improve the accuracy of videorecommendation.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a flowchart illustrating a method for controlling playing of avideo stream according to an embodiment of the present disclosure;

FIG. 2 is a schematic diagram illustrating display of a human-computerinteraction interface and a video stream playing window interfaceaccording to an embodiment of the present disclosure;

FIG. 3A is a schematic diagram illustrating display of thehuman-computer interaction interface and the video stream playing windowinterface according to another embodiment of the present disclosure;

FIG. 3B is a schematic diagram illustrating display of the video streamplaying window interface after a display ratio is restored according toanother embodiment of the present disclosure;

FIG. 4 is a flowchart illustrating a method for controlling playing of avideo stream according to another embodiment of the present disclosure;

FIG. 5 is a block diagram illustrating a terminal device according to anembodiment of the present disclosure;

FIG. 6 is a block diagram illustrating a terminal device according toanother embodiment of the present disclosure;

FIG. 7 is a schematic structural diagram illustrating an electronicdevice according to an embodiment of the present disclosure.

DETAILED DESCRIPTION OF EMBODIMENTS

In order to better understand the above technical solutions, thetechnical solutions of the embodiments of the present disclosure aredescribed in detail below through the drawings and specific embodiments.It should be understood that the embodiments of the present disclosureand the drawings in the embodiments are detailed descriptions of thetechnical solutions of the embodiments of the present disclosure, ratherthan limitations of the technical solutions of the present disclosure.In the case of no conflict, the embodiments of the present disclosureand the technical features in the embodiments may be combined with eachother.

A method for controlling playing of a video stream is provided accordingto an embodiment of the present disclosure, and is suitable for videoplaying scenarios in which video data is streamed, including but notlimited to, a short video playing scenario, a long video playingscenario, a live video scenario, and the like. The video stream includesvideo data for data transmission by means of streaming. A manner inwhich the video stream is segment is not limited herein. In practice,segmentation of the video stream is defined according to productrequirements, so as to distinguish a first video stream, a second videostream, and so on. In an example rather than a limitation, in thescenario of playing a fixed-length video file (including a short videoor other forms of video), a video file may be defined as a video stream.Alternatively, a video file is divided into several video clips in apredetermined manner (for example, a movie video file may be dividedinto several video clips according to development of the plot), and eachvideo clip serves as a video stream. In a live video scenario, theentire video playing process may be divided into multiple video clipsaccording to the live content or duration, and each video clip serves asa video stream.

The method for controlling playing of a video according to theembodiment of the present disclosure may be implemented by a terminaldevice installed with a browser or a video stream playing application.As shown in FIG. 1 , the method includes the following steps 101 to 104.

In step 101, the first video stream is stopped playing when a triggercondition for invoking a human-computer interaction portal is satisfied.

In step 102, the human-computer interaction portal is invoked while aplaying window of the first video stream is displayed.

Although in the above description, invoking the human-computerinteraction portal is executed under the condition that the triggercondition for invoking the human-computer interaction portal issatisfied, the embodiment of the present disclosure does not limit thesequence of stopping playing the first video stream and invoking thehuman-computer interaction portal. In practice, the user generally doesnot perceive the sequence of stopping playing the first video stream andinvoking the human-computer interaction portal.

The human-computer interaction portal includes, but is not limited to,at least one of: a human-computer interaction interface and ahuman-computer interaction voice module. The human-computer interactionvoice module is capable of at least playing a voice and acquiring avoice, and further capable of voice recognition or sending a collectedvoice signal and acquiring a voice recognition result.

In practice, the trigger condition for invoking the human-computerinteraction portal is flexibly configured according to the productrequirements.

In step 103, the human-computer interaction portal is closed on receiptof target feedback information received through the human-computerinteraction portal or an instruction to close the human-computerinteraction portal.

In step 104, a second video stream different from the first video streamis played. The target feedback information includes feedback informationabout the first video stream.

In the above description, the second video stream is played afteracquiring the target feedback information through the human-computerinteraction portal or receiving an instruction to close thehuman-computer interaction portal. However, the embodiment of thepresent disclosure does not limit the sequence of playing the secondvideo stream and closing the human-computer interaction portal. Inpractice, the user generally does not perceive the sequence of playingthe second video stream and closing the human-computer interactionportal.

In a case that the human-computer interaction portal is a human-computerinteraction interface, the implementation of invoking the human-computerinteraction portal while displaying a playing window of the first videostream includes but is not limited to: displaying the human-computerinteraction interface 202 on a video stream playing window interface 201as shown in FIG. 2 , or displaying the video stream playing windowinterface 201 in a first display area 301 after being compressed(equally or unequally) and displaying the human-computer interactioninterface 202 in a second display area 302 as shown in FIG. 3 . Adisplay area of the terminal device includes the first display area 301and the second display area 302. The playing window (not shown in thedrawings) of the first video stream is displayed in the video streamplaying window interface of the browser or application. Therefore, theplaying window (not shown in the drawings) of the first video stream iscompressed along with the compression of the video stream playing windowinterface 201.

In a case that the video stream playing window interface is compressedand displayed during the displaying of the human-computer interactioninterface, the display ratio of the video stream playing windowinterface 201 is restored when the second video stream is played asshown in FIG. 3B. That is, the display ratio of the playing window (notshown in the drawings) of the second video stream is restored.

The second video stream may be automatically played immediately afterthe target feedback information is acquired or the instruction to closethe human-computer interaction portal is received. Alternatively, afterthe target feedback information is acquired or the instruction to closethe human-computer interaction portal is received, the second videostream is played in response to a video stream playing instruction.

In the method according to the embodiment of the present disclosure, thehuman-computer interaction portal for the first video stream is invokedwhile displaying the playing window of the first video stream, toacquire feedback information, so as to accurately and pertinentlyacquire the user feedback on the first video stream that is currentlyplayed. Moreover, during the process of invoking the human-computerinteraction portal, the playing of the video stream is paused (i.e., thefirst video stream is stopped playing), and the video stream is resumedplaying (i.e., the second video stream is played) after thehuman-computer interaction portal is closed. Even if the human-computerinteraction portal blocks the playing window of the first video stream,the playing effect is not affected by blocking the video stream that iscurrently played because the playing of the first video stream isstopped during this process. In addition, the playing window of thestopped first video stream is still displayed since the first videostream is stopped playing, thereby prompting the user that the invokedhuman-computer interaction portal is for the first video stream, and theprompt is intuitive and simple. Additionally, the playing window is notclosed or jumped. Therefore, it is unnecessary to open or jump to theplaying window in order to display the second video stream, whichensures smoothness of playing the video stream.

In practice, there are many ways to acquire the target feedbackinformation through the human-computer interaction portal. In oneimplementation, the human-computer interaction portal includes ahuman-computer interaction interface. On the human-computer interactioninterface, feedback request information for the first video stream isdisplayed, and a target feedback information input control is alsodisplayed. The target feedback information is acquired through thetarget feedback information input control. The target feedbackinformation input control may be a text box control, a selectioncontrol, a voice signal acquisition control, an image acquisitioncontrol, and the like.

The embodiments of the present disclosure do not limit the specificcontent and expression form of the feedback request information. Theexpression form of the feedback request information may include agraphic control, a text control, a selection control and so on. Thecontent of the target feedback information may include, but is notlimited to, a request to rate the first video stream, a question aboutwhether the first video stream is interested, an investigation into theplaying fluency of the first video stream, an investigation of theplaying quality of the first video stream, and so on, which is notlimited herein.

In practice, after the terminal invokes the human-computer interactionportal, the user may refuse to input a target feedback information. Inan example rather than a limitation, in the case that the user performsthe behavior of refusing to input the target feedback information, theterminal device generates an instruction to close the human-computerinteraction portal in response to the user behavior. The video streamingbrowser or application closes the human-computer interaction portal inresponse to the instruction. In one implementation, for example, thehuman-machine interaction portal includes a human-machine interactioninterface on which feedback request information for the first videostream is displayed, and a control indicating to close the interface anda control indicating to ignore the feedback request information are alsodisplayed. If the user has an action that triggers the controlindicating to close the interface or an action that triggers the controlindicating to ignore the feedback request information, the triggeraction serves as an instruction to close the human-computer interactionportal. Alternatively, the terminal device generates an instruction toclose the human-computer interaction portal in response to the triggeraction. The video streaming browser or application closes thehuman-computer interaction interface in response to the instruction. Inanother implementation, for example, the human-computer interactionportal includes a human-computer interaction interface on which feedbackrequest information for the first video stream is displayed. The videostreaming browser or application captures an image of the user faceand/or gesture through an image capture module of the terminal device.In a case that the user acts an expression, a look or a gestureindicating to close the interface, or acts an expression, a look or agesture indicating to ignore the feedback request information, theaction serves as an instruction to close the human-computer interactionportal. Alternatively, the terminal device generates an instruction toclose the human-computer interaction portal in response to the action.The video streaming browser or application closes the human-computerinterface in response to the instruction. In another implementation, forexample, the human-computer interaction portal includes a human-computerinteraction interface on which feedback request information for thefirst video stream is displayed. The video streaming browser orapplication captures a user voice signal through a voice acquisitionmodule of the terminal device and recognizes the voice signal to acquirea voice recognition result. In a case that the voice recognition resultindicates to close the human-computer interaction portal or indicates toignore the feedback request, the voice recognition result serves as aninstruction to close the human-computer interaction portal.Alternatively, the terminal device generates an instruction to close thehuman-computer interaction portal in response to the voice recognitionresult, and the video streaming browser or application closes thehuman-computer interaction interface in response to the instruction. Inone implementation, for example, the human-computer interaction portalincludes a voice human-computer interaction speech module. Thehuman-computer interaction speech module is invoked to play the feedbackrequest information in the form of voice for the first video stream andto capture a user voice signal in order to recognize the voice signal toacquire a voice recognition result. In a case that the voice recognitionresult indicates to close the human-computer interaction portal orindicates to ignore the feedback request information, the voicerecognition result serves as an instruction to close the human-computerinteraction portal. Alternatively, the terminal device generates aninstruction to close the human-computer interaction portal in responseto the voice recognition result, and the video streaming browser orapplication closes the human-computer interaction portal in response tothe instruction. In an implementation, for example, the human-computerinteraction portal includes a voice human-computer interaction speechmodule. The voice human-computer interaction speech module is invoked toplay a feedback request information in the form of voice for the firstvideo stream. The video streaming browser or application captures animage of the user face and/or gesture via an image capture module of theterminal device. In a case that the user acts an expression, look orgesture indicating to close the interface, or acts an expression, lookor gesture indicating to ignore the feedback request information, theaction serves as an instruction to close the human-computer interactionportal. Alternatively, the terminal device generates an instruction toclose the human-computer interaction portal in response to the action,and the video streaming browser or application closes the human-computerinteraction portal in response to the instruction.

After the user inputs the target feedback information and submits theinputted target feedback information, the terminal device acquires thetarget feedback information through the human-computer interactionportal. Accordingly, the acquisition of the target feedback informationmay be considered as an instruction to close the human-computerinteraction portal. Alternatively, the terminal device generates aninstruction to close the human-computer interaction portal in responseto this acquisition, and the video streaming browser or applicationcloses the human-computer interaction portal in response to thisinstruction.

In a case that the user neither inputs the target feedback informationnor refuses to input the target feedback information, a wait is set inorder not to affect the video streaming. If the wait expires, thehuman-computer interaction portal is closed and the video stream isresumed playing. In this case, the wait timeout serves as an instructionto close the human-computer interaction portal. Alternatively, theterminal device generates an instruction to close the human-computerinteraction portal in response to the wait timeout, and the videostreaming browser or application closes the human-computer interactionportal in response to the instruction.

As described above, the second video stream different from the firstvideo stream may be played in a manner that includes, but is not limitedto, automatically playing the second video stream in response toacquisition of the target feedback information or the instruction toclose the human-computer interaction portal.

In the method according to the embodiment of the present disclosure, thevideo stream is resumed playing in response to the acquisition of thetarget feedback information or the instruction to close thehuman-computer interaction portal. That is, the applicationautomatically plays the second video stream after the user inputs thetarget feedback information and submits the target feedback informationor closes the human-computer interaction portal, thereby achieving aseamless transition between human-computer interaction and automaticcontinuation of playing as well as improving the smoothness of videoplaying.

On the basis of any one of the described method embodiments, the secondvideo stream played and/or at least one video stream played isdetermined based on the target feedback information.

In one implementation, the second video stream to be played and/or theat least one video stream to be played are determined by the presentterminal device based on the target feedback information. Accordingly,in the method according to embodiments of the present disclosure, thetarget feedback information may be inputted into a pre-trained videostream recommendation model, and the second video stream to be playedand/or the at least one video stream to be played may be determinedbased on an output of the video stream recommendation model. Theterminal device plays the determined second video stream to be playedand/or the at least one video stream to be played after acquiring thetarget feedback information through the human-computer interactionportal or receiving the instruction to close the human-computerinteraction portal. In another implementation, the terminal device sendsthe acquired target feedback information to a network device in realtime. The network device or other terminal device (to which the networkdevice forwards the target feedback information in real time) determinesthe second video stream to be played and/or the at least one videostream to be played and sends the determination to this terminal devicein real time. This terminal device plays the determined second videostream to be played and/or the at least one video stream to be playedafter acquiring the target feedback information through thehuman-machine interaction portal or receiving the instruction to closethe human-machine interaction portal.

In one implementation, the video stream to be played is maintainedthrough a video stream sequence and the video stream to be played in thevideo stream sequence is pre-loaded. After acquiring the video stream tobe played (including the second video stream and/or at least one videostream to be played afterwards) determined based on the target feedbackinformation, the terminal device may add the acquired video stream tothe video stream sequence. The terminal device may add the acquiredvideo stream to be played to the forefront of the video stream sequenceso as to play the video stream in priority. In another implementation,the terminal device determines and loads the video stream to be playedin real time rather than maintains the video stream to be played throughthe video stream sequence or even pre-loads the video stream to beplayed. When acquiring the video stream to be played determined based onthe target feedback information, the terminal device loads and plays thevideo stream in real time.

In the method according to an embodiment of the present disclosure, thevideo stream recommendation is performed in real time based on thetarget feedback information. The faster the operation speed of therecommendation model based on the target feedback information, thefaster the video stream recommendation is performed based on the targetfeedback information in order to resume playing the video stream,thereby improving the real-time performance of the video recommendationand the user experience. If the human-computer interaction portal isused to acquire the user interest in the first video stream, the targetfeedback information may truly reflect the user interest in the firstvideo stream. Therefore, real-time video stream recommendation based onthe target feedback information may also improve the accuracy of videorecommendation.

In addition, in the method according to the embodiment of the presentdisclosure, it is necessary to acquire richer feedback content in orderto make video stream recommendation in real time based on the targetfeedback information. For example, multiple feedback questions are setor multiple options are set for a feedback question, which requires arelatively larger display area. As described above, in the methodaccording to the embodiment of the present disclosure, the playing ofthe video stream is stopped when the human-computer interaction portalis invoked. Therefore, in the case where the human-computer interactionportal is a human-computer interaction interface, the playing effect ofthe video stream is not affected by blocking the video stream beingplayed even if a relatively large human-computer interaction interfaceis displayed.

On the basis of any one of the method embodiments, the trigger conditionfor invoking the human-computer interaction portal may include at leastone of: a video stream switching instruction is received; the number oftimes that the first video stream is played repeatedly reaches athreshold; playing of the first video stream is finished already; aspecific target object is detected during the playing of the first videostream; and the first video stream is played to a certain time instant.

In practice, the method according to the embodiment of the presentdisclosure supports flexible configuration of the trigger condition forinvoking the human-computer interaction portal, so as to meet productrequirements.

In one embodiment, the received video stream switching instruction maybe configured as a trigger condition for invoking the human-computerinteraction portal. When receiving the video stream switchinginstruction, the terminal device invokes the human-computer interactionportal in response to the instruction. This implementation of invokingthe human-computer interaction portal not only does not interrupt thefirst video stream being played, but also reduces the interference tothe playing of the video stream. The video stream switching instructionincludes, but is not limited to: a touch gesture for instructing toswitch the video stream, a voice instruction for instructing to switchthe video stream, and a facial feature for instructing to switch thevideo stream.

In one embodiment, the completion of playing the first video stream maybe configured as a trigger condition for invoking the human-computerinteraction portal. When the playing of the first video stream isfinished already, the terminal device invokes the human-computerinteraction portal. This implementation of invoking the human-computerinteraction portal not only does not interrupt the first video streambeing played, but also reduces the interference to the playing of thevideo stream. In practice, the condition for the completion of playingthe first video stream may be defined as required. Taking a short videoas an example, the playing control logic of the short video is asfollows. In a non-automatic playing mode, no switching instruction orother instruction for interrupting playing (such as an interface jumpinstruction) is received. When the first playing of the first videostream is finished, the terminal device automatically plays the currentvideo repeatedly. In this case, it is defined that the completion ofplaying the first video stream refers to the completion of the firstplaying of the first video stream. In an automatic playing mode, noswitching instruction or other instruction for interrupting playing(such as interface jump instruction) is received. When the first videostream is played this time, the terminal device automatically plays anext video. In this case, it is defined that the completion of playingthe first video stream means that the current playing of the first videostream is completed.

Taking a short video playing application as an example, in thenon-automatic playing mode, the playing control process of a videostream is commonly as follows. If no video stream switching instructionis detected, the current video stream is played repeatedly. When a videostream switching instruction is detected, the current video stream isswitched to a next video stream. For the scenario where the currentvideo stream is played repeatedly with no video stream switchinginstruction being received, the repetition of the current video streammay be resulted from that the user attention is diverted and the userforgets to switch or close the video stream rather than means that theuser is more interested in the content of the video stream. In thiscase, the number of repetitions of the first video stream reaching thethreshold may also be configured as the trigger condition for invokingthe human-computer interaction portal. If the number of times that thefirst video stream is played repeatedly reaches the threshold, theterminal device requests the user feedback information about the videostream by invoking the human-computer interaction portal. This can notonly improve the accuracy of the feedback information, but also improvethe accuracy of subsequent use of the feedback information, for example,improve the accuracy of video stream recommendation based on thefeedback information provided by the user.

In one embodiment, the detection of a specific target object during theplaying of the first video stream may be configured as a triggercondition for invoking the human-computer interaction portal, so thatthe feedback information can be acquired for the specific target objectin the video stream. In one embodiment, the terminal device detects inreal time whether the specific target object is included in the videostream that is currently played. When the specific target object isdetected during the playing of the first video stream, thehuman-computer interaction portal is invoked to acquire the user targetfeedback information. In the method according to the embodiment of thepresent disclosure, the timing of determining whether the first videostream includes the specific target object is not limited. For example,it may be determined when the browser or application loads or startsplaying the first video stream whether the first video stream includesthe specific target object. In the method according to the embodiment ofthe present disclosure, the implementation manner for determiningwhether the first video stream includes the specific target object isalso not limited. For example, the terminal device may read taginformation of the first video stream, and determine whether the firstvideo stream includes a specific foreground target according to the taginformation.

In one embodiment, playing the first video stream to a specific timeinstant may be set as a trigger condition for invoking thehuman-computer interaction portal. For example, for a first video streamwith a duration of t, the specific time instant may be set to 0.5t. Thatis, when the first video stream is played to 0.5t, the human-computerinteraction portal is invoked to acquire the user target feedbackinformation. It should be understood by those skilled in the art thatthe specific time instant may be set according to actual requirements,which is not limited in the present disclosure.

In addition, other trigger condition for invoking the human-computerinteraction portal may be flexibly configured according to productrequirements. For example, the following trigger condition for invokingthe human-computer interaction portal are set, including: when aspecified touch gesture is detected in a specified touch area, when aspecified voice instruction is detected, when a specified facial featureis detected and the like, which are not limited in the presentdisclosure.

In order to control the frequency of acquiring feedback information soas to reduce the impact on the playing of the video stream, aprohibition time period is set during which the human-computerinteraction portal is not invoked. Correspondingly, a current timeinstant being not within the prohibition time period is configured as aprecondition that the trigger condition for invoking the human-computerinteraction portal is satisfied.

Optionally, if the precondition that the trigger condition for invokingthe human-computer interaction portal is satisfied includes: not withinthe prohibition time period. The method according to an embodiment ofthe present disclosure may further include: detecting a trigger eventfor the prohibition time period, and resetting a timer corresponding tothe prohibition time period. If the timer does not time out, the currenttime instant is within the forbidden time period. If the timer timesout, the current time instant is not within the forbidden time period.

The prohibition time period may include multiple time periods, each timeperiod may correspond to a timer, and each time period may correspond toa trigger event.

In practice, the trigger event for the forbidden time period may beflexibly configured according to the product requirements. In animplementation, it is assumed that for the same viewing object, thehuman-computer interaction portal is invoked only once within X days torequest the target feedback information. Correspondingly, the triggerevent corresponding to the forbidden time period includes: resetting thetimer corresponding to the time period after the target feedbackinformation is acquired. In another implementation, it is assumed thatfor the same viewing object, the viewing object has not input the targetfeedback information for Y times in succession or accumulatively.Correspondingly, the trigger event corresponding to the forbidden timeperiod includes: the number of times that no target feedback informationis inputted in succession or accumulatively reaches a set threshold.Correspondingly, after each time the human-computer interaction portalis invoked, the counted value for no target feedback information isacquired in succession or accumulatively is increased if no targetfeedback information is acquired by invoking the human-computerinteraction portal this time. When the counted value reaches the setthreshold, the timer corresponding to the time period is reset.

In the method according to the embodiment of the present disclosure,whether the current time instant is within the prohibition time periodis determined by determining whether the timer corresponding to theprohibition time period times out, which is implemented in a simple andeffective manner. The timer is reset by detecting the trigger eventcorresponding to the prohibition time period, thereby ensuring that thetimer accurately reflects the prohibition time period.

In addition to determining whether the current instant is within theforbidden time period by setting a timer, the forbidden time period mayalso be set by setting the identification information corresponding tothe forbidden time period. In an implementation manner, it is assumedthat for the same viewing object, the human-computer interaction portalis invoked only once within X days to request the target feedbackinformation. Correspondingly, prohibition identification information isset, and the prohibition identification information indicates whetherthe target feedback information of the current viewing object has beenacquired within a predetermined time period (for example, within Xdays). In another implementation, it is assumed that the same viewingobject has not input the target feedback information for Y times insuccession or accumulatively. Correspondingly, prohibitionidentification information is set, and the prohibition identificationinformation indicates whether the current viewing object has not inputthe target feedback information for Y times in succession oraccumulatively.

A method for controlling playing of a video stream is also providedaccording to an embodiment of the present disclosure. The method may beimplemented by a terminal device installed with a browser or a videostreaming application program. As shown in FIG. 4 , the method includesthe following steps 401 to 402.

In step 401, when the trigger condition for invoking the human-computerinteraction portal is satisfied, the human-computer interaction portalfor the first video stream that is currently played is invoked, so as toacquire target feedback information through the human-computerinteraction portal. The target feedback information includes feedbackinformation about the first video stream.

In step 402, a second video stream different from the first video streamis played after the target feedback information is acquired. The secondvideo stream and/or at least one video stream to be played is determinedbased on the target feedback information.

In the method according to the embodiment of the present disclosure,video stream recommendation is performed in real time based on thetarget feedback information. The faster the operation speed of arecommendation model that is based on the target feedback information,the faster the video stream recommendation is performed based on thetarget feedback information in order to resumed playing the videostream, so as to improve the real-time performance and user experienceof the video recommendation. If the human-computer interaction portal isinvoked to acquire the user interest in the first video stream, thetarget feedback information truly reflects the user interest in thefirst video stream. Therefore, real-time video stream recommendationbased on the target feedback information can also improve the accuracyof video recommendation.

Optionally, the implementation manner of playing the second video streamdifferent from the first video stream after the target feedbackinformation is acquired includes, but is not limited to: automaticallyplaying the second video stream in response to acquisition of the targetfeedback information.

In the method according to the embodiment of the present disclosure, thesecond video stream is automatically played in response to acquisitionof the target feedback information, which can realize seamlesstransition between the human-computer interaction and the operation ofautomatically playing the video stream, and improve the smoothness ofvideo playing. Moreover, in a case that the second video stream isdetermined based on the target feedback information, the user feedbackcan be quickly responded according to the embodiments of the presentdisclosure, thereby further improving the user experience.

Optionally, in the method according to the embodiment of the presentdisclosure, the first video stream is stopped playing when the triggercondition for invoking the human-computer interaction portal issatisfied. Correspondingly, the implementation of invoking thehuman-computer interaction portal may include invoking thehuman-computer interaction portal while displaying the playing window ofthe first video stream. In addition, after receiving the instruction toclose the human-computer interaction portal, the human-computerinteraction portal is closed, and the second video stream is played.

In the method according to the embodiment of the present disclosure, thehuman-computer interaction portal for the first video stream is invokedto acquire feedback information while displaying the playing window ofthe first video stream, so as to acquire the user feedback on the firstvideo stream that is currently played accurately and pertinently. Inaddition, during the process of invoking the human-computer interactionportal, the playing of the video stream is paused (i.e., the first videostream is stopped playing), and the video stream is resumed playing(i.e., the second video stream is played) after the human-computerinteraction portal is closed. Even if the human-computer interactionportal blocks the playing window of the first video stream, the playingeffect is not affected by blocking the video stream that is currentlyplayed because the playing of the first video stream is stopped duringthis process. In addition, the playing window of the stopped first videostream is still displayed since the first video stream is stoppedplaying, thereby prompting the user that the invoked human-computerinteraction portal is for the first video stream, and the prompt isintuitive and simple.

For the specific implementation manner and details of the method,reference may be made to the description of the method embodiment, whichare not repeated here.

Based on the same inventive concept as the method shown in FIG. 1 , aterminal device is provided according to an embodiment of the presentdisclosure. As shown in FIG. 5 , the terminal device may include a firstplaying control module 501 and a second playing control module 502. Thevarious components may respectively execute various steps/functions ofthe method for controlling playing of a video stream described above inconjunction with FIGS. 1 to 3 . Only main functions of the components ofthe terminal device are described below, and the details that have beendescribed above are omitted.

The first playing control module 501 is configured to stop playing afirst video stream when a trigger condition for invoking ahuman-computer interaction portal is satisfied, and invoke thehuman-computer interaction portal for the first video stream while aplaying window of the first video stream is displayed.

The second playing control module 502 is configured to close thehuman-computer interaction portal and play a second video streamdifferent from the first video stream on receipt of a target feedbackinformation through the human-computer interaction portal or aninstruction to close the human-computer interaction portal. The targetfeedback information includes feedback information about the first videostream.

With the terminal device according to the embodiment of the presentdisclosure, the human-computer interaction portal for the first videostream is invoked while the playing window of the first video stream isdisplayed, to acquire feedback information, so as to accurately andpertinently acquire the user feedback on the first video stream that iscurrently played. Moreover, during the process of invoking thehuman-computer interaction portal, the playing of the video stream ispaused (i.e., the first video stream is stopped playing), and the videostream is resumed playing (i.e., the second video stream is played)after the human-computer interaction portal is closed. Even if thehuman-computer interaction portal blocks the playing window of the firstvideo stream, the playing effect is not affected by blocking the videostream that is currently played because the playing of the first videostream is stopped during this process. In addition, the playing windowof the stopped first video stream is still displayed since the firstvideo stream is stopped playing, thereby prompting the user that theinvoked human-computer interaction portal is for the first video stream,and the prompt is intuitive and simple.

Optionally, the second playing control module plays the second videostream different from the first video stream in a manner that includes,but is not limited to, automatically playing the second video stream inresponse to acquisition of the target feedback information or theinstruction to close the human-computer interaction portal.

The terminal device according to the embodiment of the presentdisclosure resumes playing the video stream in response to theacquisition of the target feedback information or the instruction toclose the human-computer interaction portal. That is, the applicationautomatically plays the second video stream after the user inputs thetarget feedback information or closes the human-computer interactionportal, thereby achieving a seamless transition between human-computerinteraction and automatic continuation of playing as well as improvingthe smoothness of video playing.

On the basis of any one of the described terminal device embodiments,the second video stream played and/or at least one video stream playedis determined based on the target feedback information.

The terminal device according to the embodiment of the presentdisclosure performs video stream recommendation in real time based onthe target feedback information. The faster the operation speed of therecommendation model based on the target feedback information, thefaster the video stream recommendation is performed based on the targetfeedback information in order to resume playing the video stream,thereby improving the real-time performance of the video recommendationand the user experience. If the human-computer interaction portal isused to acquire the user interest in the first video stream, the targetfeedback information may truly reflect the user interest in the firstvideo stream. Therefore, real-time video stream recommendation based onthe target feedback information may also improve the accuracy of videorecommendation.

In addition, with the terminal device according to the embodiment of thepresent disclosure, it is necessary to acquire richer feedback contentin order to make video stream recommendation in real time based on thetarget feedback information. For example, multiple feedback questionsare set or multiple options are set for a feedback question, whichrequires a relatively larger display area. As described above, with theterminal device according to the embodiment of the present disclosure,the playing of the video stream is stopped when the human-computerinteraction portal is invoked. Therefore, in the case where thehuman-computer interaction portal is a human-computer interactioninterface, the playing effect of the video stream is not affected byblocking the video stream being played even if a relatively largehuman-computer interaction interface is displayed.

On the basis of any one of the terminal device embodiments, the triggercondition for invoking the human-computer interaction portal may includeat least one of: a video stream switching instruction is received; thenumber of times that the first video stream is played repeatedly reachesa threshold; playing of the first video stream is finished already; aspecific target object is detected during the playing of the first videostream; and the first video stream is played to a certain time instant.

In practice, the terminal device according to the embodiment of thepresent disclosure supports flexible configuration of the triggercondition for invoking the human-computer interaction portal, so as tomeet product requirements.

In one embodiment, the terminal device may configure the received videostream switching instruction as the trigger condition for invoking thehuman-computer interaction portal. When receiving the video streamswitching instruction, the terminal device invokes the human-computerinteraction portal in response to the instruction. This implementationmanner of invoking the human-computer interaction portal does notinterrupt the first video stream being played, thereby reducing theinterference to the playing of the video stream.

In one embodiment, the terminal device may configure the completion ofplaying the first video stream as the trigger condition for invoking thehuman-computer interaction portal. When the playing of the first videostream is finished, the terminal device invokes the human-computerinteraction portal. This implementation of invoking the human-computerinteraction portal does not interrupt the first video stream beingplayed, thereby reducing the interference to the playing of the videostream.

Taking a short video playing application as an example, in thenon-automatic playing mode, the playing control process of a videostream is commonly as follows. If no video stream switching instructionis detected, the current video stream is played repeatedly. When a videostream switching instruction is detected, the current video stream isswitched to a next video stream. For the scenario where the currentvideo stream is played repeatedly with no video stream switchinginstruction being received, the repetition of the current video streammay be resulted from that the user attention is diverted and the userforgets to switch or close the video stream rather than means that theuser is more interested in the content of the video stream. In thiscase, in one embodiment, the terminal device may configure the number ofrepetitions of the first video stream reaching the threshold as thetrigger condition for invoking the human-computer interaction portal. Ifthe number of times that the first video stream is played repeatedlyreaches the threshold, the terminal device requests the user feedbackinformation about the video stream by invoking the human-computerinteraction portal. This can not only improve the accuracy of thefeedback information, but also improve the accuracy of subsequent use ofthe feedback information, for example, improve the accuracy of videostream recommendation based on the feedback information provided by theuser.

In one embodiment, the detection of a specific target object during theplaying of the first video stream may be configured as a triggercondition for invoking the human-computer interaction portal, so thatthe feedback information can be acquired for the specific target objectin the video stream. In one embodiment, the terminal device detects inreal time whether the specific target object is included in the videostream that is currently played. When the specific target object isdetected during the playing of the first video stream, thehuman-computer interaction portal is invoked to acquire the user targetfeedback information. In the method according to the embodiment of thepresent disclosure, the timing of determining whether the first videostream includes the specific target object is not limited. For example,it may be determined when the browser or application loads or startsplaying the first video stream whether the first video stream includesthe specific target object. In the method according to the embodiment ofthe present disclosure, the implementation manner for determiningwhether the first video stream includes the specific target object isalso not limited. For example, the terminal device may read taginformation of the first video stream, and determine whether the firstvideo stream includes a specific foreground target according to the taginformation.

In one embodiment, playing the first video stream to a specific timeinstant may be set as a trigger condition for invoking thehuman-computer interaction portal. For example, for a first video streamwith a duration of t, the specific time instant may be set to 0.5t. Thatis, when the first video stream is played to 0.5t, the human-computerinteraction portal is invoked to acquire the user target feedbackinformation. It should be understood by those skilled in the art thatthe specific time instant may be set according to actual requirements,and is not limited in the present disclosure.

In addition, other trigger condition for invoking the human-computerinteraction portal may be flexibly configured according to productrequirements. For example, the following trigger condition for invokingthe human-computer interaction portal are set, including: when aspecified touch gesture is detected in a specified touch area, when aspecified voice instruction is detected, when a specified facial featureis detected and the like, which are not limited herein.

In order to control the frequency of acquiring feedback information soas to reduce the impact on the playing of the video stream, aprohibition time period is set during which the human-computerinteraction portal is not invoked. Correspondingly, a current timeinstant being not within the prohibition time period is configured as aprecondition that the trigger condition for invoking the human-computerinteraction portal is satisfied.

Optionally, if the precondition that the trigger condition for invokingthe human-computer interaction portal is satisfied includes: not withinthe prohibition time period. The method according to an embodiment ofthe present disclosure may further include: detecting a trigger eventfor the prohibition time period, and resetting a timer corresponding tothe prohibition time period. If the timer does not time out, the currenttime instant is within the forbidden time period. If the timer timesout, the current time instant is not within the forbidden time period.

Based on the same inventive concept as the method shown in FIG. 4 , aterminal device is provided according to an embodiment of the presentdisclosure. As shown in FIG. 6 , the terminal device may include ahuman-computer interaction portal invocation module 601 and a videostream playing module 602. The various components may respectivelyexecute various steps/functions of the method for controlling playing ofa video stream described above in conjunction with FIGS. 2 to 4 . Onlymain functions of the components of the terminal device are describedbelow, and the details that have been described above are omitted.

The human-computer interaction portal invocation module 601 isconfigured to invoke a human-computer interaction portal for a firstvideo stream that is currently played when the trigger condition forinvoking the human-computer interaction portal is satisfied, so as toacquire target feedback information through a human-computer interactioninterface. The target feedback information includes feedback informationabout the first video stream.

The video stream playing module 602 is configured to play a second videostream different from the first video stream after the target feedbackinformation is acquired. The second video stream and/or at least onevideo stream to be played is determined based on the target feedbackinformation.

The terminal device according to the embodiment of the presentdisclosure performs video stream recommendation in real time based onthe target feedback information. The faster the operation speed of arecommendation model that is based on the target feedback information,the faster the video stream recommendation is performed based on thetarget feedback information when the video stream is resumed playing, soas to improve the real-time performance and user experience of the videorecommendation. If the human-computer interaction portal is invoked toacquire the level of user interest in the first video stream, the targetfeedback information truly reflects the level of user interest in thefirst video stream. Therefore, real-time video stream recommendationbased on the target feedback information can also improve the accuracyof video recommendation.

Optionally, the video stream playing module plays the second videostream different from the first video stream after the target feedbackinformation is acquired in a manner including but not limited to:automatically playing the second video stream in response to acquisitionof the target feedback information.

The terminal device according to the embodiment of the presentdisclosure automatically plays the second video stream in response toacquisition of the target feedback information, which can realizeseamless transition between the human-computer interaction and theoperation of automatically playing the video stream, and improve thesmoothness of video playing.

Optionally, the video stream playing module further stops playing thefirst video stream when the trigger condition for invoking thehuman-computer interaction portal is satisfied. Correspondingly, theimplementation manner of invoking the human-computer interaction portalby the human-computer interaction portal invocation module may includeinvoking the human-computer interaction portal while displaying theplaying window of the first video stream. In addition, after receivingthe instruction to close the human-computer interaction portal, thehuman-computer interaction portal vocation module closes thehuman-computer interaction portal, and the video stream playing moduleplays the second video stream. The target feedback information includesfeedback information about the first video stream.

The terminal device according to the embodiment of the presentdisclosure stops playing the video stream (i.e., stops playing the firstvideo stream) during the process of invoking the human-computerinteraction portal, and automatically resumes playing the video stream(i.e., plays the second video stream) after closing the human-computerinteraction portal. Even if the human-computer interaction portal blocksthe playing window of the first video stream, the playing effect is notaffected by blocking the video stream that is currently played becausethe playing of the first video stream is stopped during this process. Inaddition, the playing window of the stopped first video stream is stilldisplayed since the first video stream is stopped playing, therebyprompting the user that the invoked human-computer interaction portal isfor the first video stream, and the prompt is intuitive and simple.

The modules may be implemented as software components executing on oneor more general-purpose processors, or as hardware, such as programmablelogic devices and/or application-specific integrated circuits, thatperform certain functions or combinations thereof. In some embodiments,the modules may be embodied in the form of a software product. Thesoftware product may be stored on a non-volatile storage medium. Thenon-volatile storage medium includes a program for causing a computerdevice (e.g., a personal computer, a server, a network device, and amobile terminal) to implement the methods described in the embodimentsof the present disclosure. In one embodiment, the modules may beimplemented on a single device, or distributed on multiple devices. Thefunctions of the modules may be combined with each other or each may besplit into multiple sub-modules.

The terminal device in the various embodiments may include a smartphone, a palmtop computer, a tablet computer, a wearable device with adisplay screen, an on-board computer, a smart speaker, a personalcomputer, and the like.

Those skilled in the art should clearly understand that, for thespecific operation process of the modules of the terminal devicedescribed, reference may be made to the corresponding processes in theforegoing method embodiments for the convenience and brevity of thedescription, which are not repeated here.

Based on the same inventive concept as the method, an electronic deviceis also provided according to an embodiment of the present disclosure.The electronic device includes a processor and a memory.

The memory is configured to store programs for implementing the methodsdescribed in the various method embodiments. The processor is configuredto execute programs stored in the memory. When program instructionsstored in the memory are executed by the processor, the processorimplements the methods described in the various method embodiments, andis further configured to implement corresponding modules in the terminaldevice according to the embodiments of the present disclosure. Theprocessor may be a processing unit including a central processing unit(CPU) or other form of processing unit capable of processing data and/orexecuting instructions, and may control other components in theelectronic device to perform desired functions. The memory may includeone or more computer program products. The computer program product mayinclude various forms of computer-readable storage media, such as avolatile memory and/or a non-volatile memory. The volatile memory mayinclude, for example, a random-access memory (RAM) and/or a cachememory, or the like. The non-volatile memory may include, for example, aread only memory (ROM), a hard disk, a flash memory, and the like. Oneor more computer program instructions may be stored on thecomputer-readable storage medium. The processor may execute the programinstructions to implement the functions of the embodiments of thepresent disclosure described above and/or other desired functions.

In a case that the electronic device according to the embodiment of thepresent disclosure implements the method shown in FIG. 1 , thehuman-computer interaction portal for the first video stream is invokedwhile the playing window of the first video stream is displayed, toacquire the feedback information, thereby acquiring the user feedback onthe first video stream that is currently played accurately andpertinently. Moreover, during the process of invoking the human-computerinteraction portal, the playing of the video stream is paused (i.e., thefirst video stream is stopped playing), and the video stream is resumedplaying (i.e., the second video stream is played) after thehuman-computer interaction portal is closed. Even if the human-computerinteraction portal blocks the playing window of the first video stream,the playing effect is not affected by blocking the video stream that iscurrently played because the playing of the first video stream isstopped during this process. In addition, the playing window of thestopped first video stream is still displayed since the first videostream is stopped playing, thereby prompting the user that the invokedhuman-computer interaction portal is for the first video stream, and theprompt is intuitive and simple.

In a case that the electronic device according to the embodiment of thepresent disclosure implements the method shown in FIG. 5 , the videostream recommendation is performed in real time based on the targetfeedback information. The faster the operation speed of therecommendation model based on the target feedback information, thefaster the video stream recommendation is performed based on the targetfeedback information in order to resume playing the video stream,thereby improving the real-time performance of the video recommendationand the user experience. If the human-computer interaction portal isused to acquire the user interest in the first video stream, the targetfeedback information may truly reflect the user interest in the firstvideo stream. Therefore, real-time video stream recommendation based onthe target feedback information may also improve the accuracy of videorecommendation.

The electronic device according to the embodiment of the presentdisclosure may include a terminal device. As shown in FIG. 7 , theterminal device includes not only a processor and a memory, but also aninput device (such as a touch screen, a camera, and a microphone), anoutput device (such as a display screen, and a speaker), a communicationmodule, and a power supply module.

The memory, the input device, the output device, the communicationmodule, and the power supply module are connected to the processorthrough a serial port, a bus or a USB interface. For a single-processorterminal device, the processor is the CPU (central processing unit). Fora dual-processor terminal device, the processor includes a masterprocessor and a slave processor. The master processor runs anapplication program to implement the method according to the embodimentof the present disclosure. The slave processor controls thecommunication module to cooperate to implement communication with theoutside. For a terminal device includes a GPU (graphics processing unit)and a CPU, the processor includes the GPU and the CPU. The methodsaccording to the embodiments of the present disclosure are implementedby the GPU and the CPU in cooperation or by the CPU alone.

The memory of the terminal device may include, but is not limited to, aflash memory, a RAM (random-access memory), a ROM (read only memory) andthe like. The RAM is mainly configured to store programs and data whenthe terminal device is running. The program to be executed or the datato be processed must be loaded into the RAM first. The ROM is mainlyconfigured to check the configuration of an operating system of theterminal device and provide the most basic input and output (I/O)programs. The flash memory is a long-life non-volatile (retains storeddata information in the event of a power failure) memory. Data deletionis in units of fixed blocks rather than single bytes. Since the flashmemory still stores data during power failure, the flash memory isusually configured to store setting information, such as the settinginformation of the user on the mobile phone.

It should be noted that a terminal device (such as a single-processorsmartphone, a dual-processor smartphone, a smart wearable device, and atablet computer) may include more or less hardware structures than theterminal device shown in FIG. 7 . However, a terminal device including amemory and a processor and capable of implementing the functions of themethod embodiments should falls within the protection scope of thepresent disclosure.

Based on the same inventive concept as the method, a computer-readablestorage medium storing a program for executing the method in the variousembodiments is further provided according to an embodiment of thepresent disclosure.

In a case that the program stored in the computer-readable storagemedium according to the embodiment of the present disclosure implementsthe method shown in FIG. 1 , the playing of the video stream is paused(i.e., the first video stream is stopped playing) during the process ofinvoking the human-computer interaction portal, and the video stream isresumed playing (i.e., the second video stream is played) after thehuman-computer interaction portal is closed. Even if the human-computerinteraction portal blocks the playing window of the first video stream,the playing effect is not affected by blocking the video stream that iscurrently played because the playing of the first video stream isstopped during this process. In addition, the playing window of thestopped first video stream is still displayed since the first videostream is stopped playing, thereby prompting the user that the invokedhuman-computer interaction portal is for the first video stream, and theprompt is intuitive and simple.

In a case that the program stored in the computer-readable storagemedium according to the embodiment of the present disclosure implementsthe method shown in FIG. 4 , the video stream recommendation isperformed in real time based on the target feedback information. Thefaster the operation speed of the recommendation model based on thetarget feedback information, the faster the video stream recommendationis performed based on the target feedback information in order to resumeplaying the video stream, thereby improving the real-time performance ofthe video recommendation and the user experience. If the human-computerinteraction portal is used to acquire the user interest in the firstvideo stream, the target feedback information may truly reflect the userinterest in the first video stream. Therefore, real-time video streamrecommendation based on the target feedback information may also improvethe accuracy of video recommendation.

The specification is described with reference to flowchart illustrationsand/or block diagrams of methods, devices (systems), and computerprogram products according to embodiments in the specification. Itshould be understood that each flow and/or block in the flowchartsand/or block diagrams, and combinations of flows and/or blocks in theflowcharts and/or block diagrams, may be implemented by computer programinstructions. These computer program instructions may be provided to aprocessor of a general purpose computer, a special purpose computer, anembedded processor or other programmable data processing device toproduce a machine, so that instructions executed by a processor of acomputer or other programmable data processing device produce means forimplementing the functions specified in a flow or flows of theflowcharts and/or a block or blocks in the block diagrams.

The computer program instructions may also be stored in acomputer-readable memory capable of directing a computer or otherprogrammable data processing device to function in a particular manner,such that the instructions stored in the computer-readable memory resultin an article of manufacture including an instruction device. Theinstruction device implements the functions specified in a flow or flowsin the flowcharts and/or a block or blocks in the block diagrams.

These computer program instructions may also be loaded onto a computeror other programmable data processing device so that a series ofoperational steps are performed on the computer or other programmabledevice to produce computer-implemented processing. Therefore, theinstructions executing on a computer or other programmable deviceprovide steps for implementing the functions specified in a flow orflows in the flowcharts and/or a block or blocks in the block diagrams.

Although preferred embodiments of this specification have beendescribed, additional changes and modifications to these embodiments maymade by those skilled in the art once the basic inventive concepts areknown. Therefore, the appended claims are intended to be construed toinclude the preferred embodiments and all changes and modifications thatfall within the scope of this specification.

It is apparent that those skilled in the art may make various changesand modifications to this specification without departing from thespirit and scope of this specification. Therefore, provided that thesemodifications and variations of the present specification fall withinthe scope of the claims of the present specification and their technicalequivalents, the present specification is also intended to include thesemodifications and variations.

The invention claimed is:
 1. A method for controlling playing of a videostream, comprising: stopping playing a first video stream when a triggercondition for invoking a human-computer interaction portal is satisfied,and invoking the human-computer interaction portal for the first videostream while displaying a playing window of the first video stream; andclosing the human-computer interaction portal and playing a second videostream different from the first video stream on receipt of targetfeedback information through the human-computer interaction portal or onreceipt of an instruction to close the human-computer interactionportal, wherein the target feedback information comprises feedbackinformation about the first video stream, and wherein the playing asecond video stream different from the first video stream furthercomprises: receiving the target feedback information through thehuman-computer interaction portal, wherein the human-computerinteraction portal is configured to acquire user feedback on the firstvideo stream, and wherein the target feedback information indicates alevel of user interest in the first video stream, performing real-timevideo stream recommendation by inputting the target feedback informationinto a pre-trained video stream recommendation model, determining thesecond video stream based on an output of the video streamrecommendation model, and automatically playing the second video stream.2. The method according to claim 1, wherein the second video stream orat least one video stream to be played is determined based on the targetfeedback information.
 3. The method according to claim 1, wherein thetrigger condition for invoking the human-computer interaction portalcomprises at least one of: a video stream switching instruction isreceived; the number of times that the first video stream is playedrepeatedly reaches a threshold; playing of the first video stream isfinished already; a specific target object is detected during playing ofthe first video stream; and the first video stream is played to acertain time instant.
 4. A method for controlling playing of a videostream, comprising: invoking a human-computer interaction portal for afirst video stream that is currently played when a trigger condition forinvoking the human-computer interaction portal is satisfied, to acquiretarget feedback information through the human-computer interactionportal, wherein the target feedback information comprises feedbackinformation about the first video stream; and playing a second videostream different from the first video stream after the target feedbackinformation is acquired, wherein the second video stream or at least onevideo stream to be played is determined based on the target feedbackinformation, and wherein the playing a second video stream differentfrom the first video stream after the target feedback information isacquired further comprises: receiving the target feedback informationthrough the human-computer interaction portal, wherein thehuman-computer interaction portal is configured to acquire user feedbackon the first video stream, and wherein the target feedback informationindicates a level of user interest in the first video stream, performingreal-time video stream recommendation by inputting the target feedbackinformation into a pre-trained video stream recommendation model,determining the second video stream based on an output of the videostream recommendation model, and automatically playing the second videostream.
 5. A terminal device, comprising: a memory; and a processorconfigured to execute the program stored in the memory; wherein thememory is configured to store a program for implementing operationscomprising: stopping playing a first video stream when a triggercondition for invoking a human-computer interaction portal is satisfied,and invoking the human-computer interaction portal for the first videostream while a playing window of the first video stream is displayed;and closing the human-computer interaction portal and playing a secondvideo stream different from the first video stream on receipt of targetfeedback information through the human-computer interaction portal or onreceipt of an instruction to close the human-computer interactionportal, wherein the target feedback information comprises feedbackinformation about the first video stream, and wherein the playing asecond video stream different from the first video stream furthercomprises: receiving the target feedback information through thehuman-computer interaction portal, wherein the human-computerinteraction portal is configured to acquire user feedback on the firstvideo stream, and wherein the target feedback information indicates alevel of user interest in the first video stream, performing real-timevideo stream recommendation by inputting the target feedback informationinto a pre-trained video stream recommendation model, determining thesecond video stream based on an output of the video streamrecommendation model, and automatically playing the second video stream.6. The device according to claim 5, wherein the second video stream orat least one video stream to be played is determined based on the targetfeedback information.
 7. The device according to claim 5, wherein thetrigger condition for invoking the human-computer interaction portalcomprises at least one of: a video stream switching instruction isreceived; the number of times that the first video stream is playedrepeatedly reaches a threshold; playing of the first video stream isfinished already; a specific target object is detected during playing ofthe first video stream; and the first video stream is played to acertain time instant.
 8. A non-transitory computer-readable storagemedium, storing a program for implementing the method according to claim1.